4 resultados para CHD Prediction, Blood Serum Data Chemometrics Methods

em Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco


Relevância:

100.00% 100.00%

Publicador:

Resumo:

When it comes to information sets in real life, often pieces of the whole set may not be available. This problem can find its origin in various reasons, describing therefore different patterns. In the literature, this problem is known as Missing Data. This issue can be fixed in various ways, from not taking into consideration incomplete observations, to guessing what those values originally were, or just ignoring the fact that some values are missing. The methods used to estimate missing data are called Imputation Methods. The work presented in this thesis has two main goals. The first one is to determine whether any kind of interactions exists between Missing Data, Imputation Methods and Supervised Classification algorithms, when they are applied together. For this first problem we consider a scenario in which the databases used are discrete, understanding discrete as that it is assumed that there is no relation between observations. These datasets underwent processes involving different combina- tions of the three components mentioned. The outcome showed that the missing data pattern strongly influences the outcome produced by a classifier. Also, in some of the cases, the complex imputation techniques investigated in the thesis were able to obtain better results than simple ones. The second goal of this work is to propose a new imputation strategy, but this time we constrain the specifications of the previous problem to a special kind of datasets, the multivariate Time Series. We designed new imputation techniques for this particular domain, and combined them with some of the contrasted strategies tested in the pre- vious chapter of this thesis. The time series also were subjected to processes involving missing data and imputation to finally propose an overall better imputation method. In the final chapter of this work, a real-world example is presented, describing a wa- ter quality prediction problem. The databases that characterized this problem had their own original latent values, which provides a real-world benchmark to test the algorithms developed in this thesis.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Spurious oscillations are one of the principal issues faced by microwave and RF circuit designers. The rigorous detection of instabilities or the characterization of measured spurious oscillations is still an ongoing challenge. This project aims to create a new stability analysis CAD program that tackles this chal- lenge. Multiple Input Multiple Output (MIMO) pole-zero identification analysis is introduced on the program as a way to create new methods to automate the stability analysis process and to help designers comprehend the obtained results and prevent incorrect interpretations. The MIMO nature of the analysis contributes to eliminate possible controllability and observability losses and helps differentiate mathematical and physical quasi-cancellations, products of overmodeling. The created program reads Single Input Single Output (SISO) or MIMO frequency response data, and determines the corresponding continuous transfer functions with Vector Fitting. Once the transfer function is calculated, the corresponding pole/zero diagram is mapped enabling the designers to analyze the stability of an amplifier. Three data processing methods are introduced, two of which consist of pole/zero elimina- tions and the latter one on determining the critical nodes of an amplifier. The first pole/zero elimination method is based on eliminating non resonant poles, whilst the second method eliminates the poles with small residue by assuming that their effect on the dynamics of a system is small or non-existent. The critical node detection is also based on the residues; the node at which the effect of a pole on the dynamics is highest is defined as the critical node. In order to evaluate and check the efficiency of the created program, it is compared via examples with another existing commercial stability analysis tool (STAN tool). In this report, the newly created tool is proved to be as rigorous as STAN for detecting instabilities. Additionally, it is determined that the MIMO analysis is a very profitable addition to stability analysis, since it helps to eliminate possible problems of loss of controllability, observability and overmodeling.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

In the problem of one-class classification (OCC) one of the classes, the target class, has to be distinguished from all other possible objects, considered as nontargets. In many biomedical problems this situation arises, for example, in diagnosis, image based tumor recognition or analysis of electrocardiogram data. In this paper an approach to OCC based on a typicality test is experimentally compared with reference state-of-the-art OCC techniques-Gaussian, mixture of Gaussians, naive Parzen, Parzen, and support vector data description-using biomedical data sets. We evaluate the ability of the procedures using twelve experimental data sets with not necessarily continuous data. As there are few benchmark data sets for one-class classification, all data sets considered in the evaluation have multiple classes. Each class in turn is considered as the target class and the units in the other classes are considered as new units to be classified. The results of the comparison show the good performance of the typicality approach, which is available for high dimensional data; it is worth mentioning that it can be used for any kind of data (continuous, discrete, or nominal), whereas state-of-the-art approaches application is not straightforward when nominal variables are present.